STIC Biotechnology Systems Branch 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.4.0 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkniote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reolv including a sequence listing in electronic form should NOT be sent to th e 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio «http://www.usDto.gov/ebc/efs/downloads/documents>htm> , EFS Submission 

User Manual- ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 223 13-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 

Revised 01/10/06 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

I Wrapped Nudeics 

Wrapped Aminos 



I: 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



Misaligned Amino 
Numbering 

Non-ASCII 



^Variable Length 



Patentin 2.0 
" "bug" 



Skipped Sequences 
~(OLD RULES) 



Skipped Sequences 
"(NEW RULES) 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCIl(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug* in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequencers) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION :SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the M (ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



AJscof n's or Xaa's 
(NEW RULES) 



10 



11 



Invalid <213> 



12 



Useof<220> 



Patentin 2.0 
"bug" 



Sequence(s) m 



/missing: If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n 's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <2 1 3> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence, (see item 1 1 below) 

Sequence(s) missing the <220> "Feature** and associated numeric identifiers and responses. Use 

of <220> to <223> is MANDATORY if <213> Organism" response is "Artificial Sequence" or "Unknown." 
Please explain source of genetic material in <220> to <223> section or use "chemically synthesized" as 
explanation. (See "Federal Register," 06701/1998, Vol. 63, No. 104, pp. 29631-32), also Sec. 1.823 of 
Sequence Rules 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa "n" can only represent a single nucleotide: "Xaa" can only represent a single amino acid 
AMC - STIC Systems Branch - 03/02/06 



RAW SEQUENCE LISTING DATE: 04/06/2006 

PATENT APPLICATION: US/10/573, 600 TIME: 10:49:35 



C--> 
C--> 



3 
4 
5 
6 
7 
9 
10 
12 
14 
14 
14 
15 
17 
19 
21 
22 
23 
24 
26 
27 
29 
31 
33 
35 



Input Set : A:\UPN-P3230-sequence listing.txt 
Output Set: N:\CRF4\04062006\J573600.raw 

<110> APPLICANT: The Trustees of the University of Pennsylvania 
Wilson, James M. 
Gao, Guangping 
Alvira, Mauricio R. 
Vandenberghe, Luk H. 

<120> TITLE OF INVENTION: Adeno -Associated Virus (AAV) Clades, Sequences, Vectors 

Containing Same, and Uses Therefor 
<130> FILE REFERENCE : UPN-P3230PCT 
<140> CURRENT APPLICATION NUMBER: US/10/573,600 
<141> CURRENT PILING DATE: 2006-03-24 
<150> PRIOR APPLICATION NUMBER: US 60/50B,226 
<151> PRIOR FILING DATE: 2003-09-30 DOOS Not Cftm 

<160> NUMBER OF SEQ ID NOS: 236 COfrenf tst ^ ^ 

<170> SOFTWARE: Patent In version 3.3 Diskette Needed 

<212> TYPE: DNA^-- " " ' AT***' r° CAA^ 

<213> ORGANI SMg ^adeno - as s oc i a t ed virus, clone hu.31^> 
SEQUENCE " — ■ 



4T 



<400> SEQUENCE: T 

atggctgccg atggttatct 
gagtggtggg ctttgaaacc 
aacgctcgag gtcttgtgct 
aagggggagc cggtcaacgc 
cagcagctca aggccggaga 
3^h- eaggagcggc-tcaaagaaga 
39 gccaaaaaga ggcttcttga 



41 
43 
45 
47 
49 
51 
53 
55 
57 
59 
61 
63 
65 
67 
69 
71 
73 



tccagattgg ctcgaggaca accttagtga aggaattcgc 
tggagcccct caacccaagg caaatcaaca acatcaagac 
tccgggttac aaataccttg gacccggcaa cggactcgac 
agcagacgcg gcggccctcg agcacgacaa ggcctacgac 
caacccgtac ctcaagtaca accacgccga cgccgagttc 
-tacgtc tttt gg gggc aacc Lc g g gcgagc agtctt ccag- 
acctcttggt ctggttgagg aagcggctaa gacggctcct 
gcagtctcct caggaaccgg actcctccgc gggtattggc 
taaaaagaga ctcaatttcg gtcagactgg cgacacagag 
aatcggagaa cctcccgcag ccccctcagg tgtgggatct 
tggcgcacca gtggcagaca ataacgaagg tgccgatgga 
gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga' cagagtcatc 



ggaaagaaga ggcctgtaga 
aaatcgggtg cacagcccgc 
tcagtcccag accctcaacc 
cttacaatgg cttcaggtgg 



atcacctcta caagcaaacc 
acttcggcta cagcaccccc 
caccacgtga ctggcagcga 
acttcaagct cttcaacatt 
tcgccaataa ccttaccagc 



accaccagca cccgaacctg ggccctgccc acctacaaca 
tccaacagca catctggagg atcttcaaat gacaacgcct 
tgggggtatt ttgacttcaa cagattccac tgccacttct 
ctcatcaaca acaactgggg attccggcct aagcgactca 
caggtcaaag aggttacgga caacaatgga gtcaagacca 
acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac 
gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg 
acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc 
ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta 
cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc 
gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg 
ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct 



60 
120 
180 
240 
300 
—360- 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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RAW SEQUENCE LISTING 
' PATENT APPLICATION: US/10/573,600 



DATE: 04/06/2006 
TIME: 10:49:35 



Input Set : A:\OPN-P3230-eequence listing 
Output Set: N:\CRF4\04062006\J573600.raw 



. txt 



75 ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa 1500 

77 tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct 1560 

79 ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct 1620 

61 ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata 1680 

83 accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg 1740 

85 gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga 1800 

87 atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc 1860 

89 aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg 1920 

91 aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg 1980 

93 gccttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc 2040 

95 gtggagatcg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag 2100 

97 tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta 2160 

99 tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a 2211 

102 <210> SEQ ID NO: 2 

103 <211> LENGTH: 2211 _, 

104 <2I2> TYPE: DNA^T ^ 

105 <213> ORGANISM a^new AAV serotype, clone hu.32 J) 

107 <400> SEQUENCE: 2 — ■ ■ 

108 atggctgccg atggttatct tccagattgg ctcgaggaca ctctctctga aggaataaga 60 
110 cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac 120 
112 gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccggcaa cggactcgac 180 
114 aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac 240 
116 cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc 300 
118 caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 
120 gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct 420 
122 ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480 
124 aaatcgggtt cacagcccgc taaaaagaaa ctcaatttcg gtcagactgg cgacacagag 540 
126 tcagtccccg accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct 600 
128 cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga 660 
130 gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc 720 
132 accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc 780 
134 tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc 840 
136 tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga 900 
138 ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt 960 
140 caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc 1020 
142 acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac 1080 
144 gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg 1140 
146 acgcttaatg atgggagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc 1200 
148 ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta 1260 
150 cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc 1320 
152 gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaa&cg 1330 
154 ctaaaattca gcgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct 1440 
156 ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa 1500 
158 tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct 1560 
X60 ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct 1620 
162 ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata 1680 
164 accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg 1740 
166 gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga 1800 
168 atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc 1860 
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HAW SEQUENCE LISTING DATE: 04/06/2006 

* PATENT APPLICATION: US/10/573,600 TIME: 10:49:35 

Input Set : A:\UPN-P3230-Bequence lieting.txt 
Output Set: N:\CRP4\04O62OO6\J573600 .raw 

170 aaaattcctc acacggacgg caactttcac ccttctccgc taatgggagg gtttggaatg 1920 
172 aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg 1980 
174 gctttcaata aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc 2040 
176 gtggagattg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag 2100 
178 tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta 2160 
180 tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a 2211 

183 <210> SEQ ID NO: 3 

184 <211> LENGTH: 2211 

185 <212> TYPE: DNA 

186 <213> ORGAN I SM:^adeno- associated virus, human clone 9 
188 <400> SEQUENCE: 




189 atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc 60 

191 gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac 120 

193 aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac 180 

195 aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac 240 

197 cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc 300 

199 caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 

201 gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct 420 

203 ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc 480 

205 aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag 540 

207 tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct 600 

209 cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga 660 

211 gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc 720 

213 accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc 780 

215 tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc 840 

217 tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga 900 

219 ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt 960 

221 caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc 1020 

223 acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac 1080 

225 gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg 1140 

227 acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc 1200 

229 ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta 1260 

231 cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc 1320 

233 gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg 1380 

235 ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct 1440 

237 ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa 1500 

239 tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct 1560 

241 ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct 1620 

243 ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata 1680 

245 accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg 1740 

247„gccacaaacc^ccagagtgc^c caagcacag_ gogoagaccg^gctsggttca-^aaaccaagga 1800- 

249 atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc 1860 

251 aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg 1920 

253 aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg 1980 

255 gccttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc 2040 

257 gtggagatcg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag 2100 

259 tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta 2160 

261 tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a 2211 
264 <210> SEQ ID NO: 4 
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RAW SEQUENCE LISTING DATE: 04/06/2006 

PATENT APPLICATION: US/10/573,600 TIME : 10:49:35 

Input Set : A:\UPN-P3230-aequence listing.txt 
Output Set: N:\CRF4\04062006\J573600.raw 




265 <211> LENGTH: 2217 

266 <212> TYPE: DNA 

267 <213> ORGANISM; 

269 <400> SEQUENCES _______ _ 

270 atggctgccg atggttatct tccagatt£g-<2tcgaggaca acctctctga gggcattcgc 60 
272 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 
274 gacggccggg gtctggtgct tcctggctgc aagtacctcg gacccttcaa cggactcgac 180 
276 aagggggagc ccgtcaacgc ggcggacgca gcggccctcg agcacgacaa ggcctacgac 240 
278 cagcagctca aagcgggtga caatccgtac ctgcggtata accacgccga cgccgagttt 300 
280 caggagcgtc tgcaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag 360 
282 gccaagaagc gggttctcga acctctcggt ctggttgagg aaggcgctaa gacggctcct 420 
284 ggaaagaaga gaccggtaga gccatcaccc cagcgttctc cagactcctc tacgggcatc 480 
286 ggcaagacag gccagcagcc cgcgaaaaag agactcaact ttgggcagac tggcgactca 540 
288 gagtcagtgc ccgaccctca accaatcgga gaaccccccg caggcccctc tggtctggga 600 
290 tctggtacaa tggctgcagg cggtggcgct ccaatggcag acaataacga aggcgccgac 660 
292 ggagtgggt* gfctcctcagg asattggcat tgcgattcca catggctggg cgacagagtc 720 
294 atcaccacca gcacccgaac ctgggccctc cccacctaca acaaccacct ctacaagcaa 780 
296 atctccaacg ggacatcggg aggaagcacc aacgacaaca cctacttcgg ctacagcacc 840 
298 ccctgggggt attttgactt taacagattc cactgccact tctcaccacg tgactggcag 900 
300 cgactcatca acaacaactg gggattccgg cccaagagac tcaacttcaa gctcttcaac 960 
302 atccaggtca aggaggtcac gcagaatgaa ggcaccaaga ccatcgccaa taaccttacc 1020 
304 agcacgattc aggtctttac ggactcggaa taccagctcc cgtacgtcct cggctctgcg 1080 
306 caccagggct gcccgcctcc gttcccggcg gacgtcttca tgattcctca gtacgggtac 1140 
308 ctgactctga acaacggcag tcaggccgtg ggccgttcct ccttctactg cctggagtac 1200 
310 tttccttctc aaatgcggag aacgggcaac aactttgagt tcagctacca gtttgaggac 1260 
312 gtgccttttc acagcagcta cgcgcatagc caaagcctgg accggctgat gaaccccctc 1320 
314 atcgaccagt acctgtacta cctgtctcgg actcagtcca cgggaggtac cgcaggaact 1380 
316 cagcagttgc tattttctca ggccgggcct aataacatgt cggctcaggc caaaaactgg 1440 
318 ctacccgggc cctgctaccg gcagcaacgc gtctccacga cactgtcgca aaataacaac 1500 
320 agcaactttg cttggaccgg tgccaccaag tatcatctga atggcagaga ctctctggta 1560 
322 aatcccggtg tcgctatggc aacgcacaag gacgacgaag agcgattttt tccatccagc 1620 
324 ggagtcttga tgtttgggaa acagggagct ggaaaagaca acgtggacta tagcagcgtt 1680 
326 atgctaacca gtgaggaaga aatcaaaacc accaacccag tggccacaga acagtacggc 1740 
328 gtggtggccg ataacctgca acagcaaaac gccgctccta ttgtaggggc cgtcaacagt 1800 
330 caaggagcct tacctggcat ggtctggcag aaccgggacg tgtacctgca gggtcctatc 1860 
332 tgggccaaga ttcctcacac ggacggcaac tttcatcctt cgccgctgat gggaggcttt 1920 
334 ggactgaaac acccgcctcc tcagatcctg attaagaata cacctgttcc cgcggatcct 1980 
336 ccaactacct tcagtcaagc caagctggcg tcgttcatca cgcagtacag caccggacag 2040 
338 gtcagcgtgg aaattgaatg ggagctgcag aaagagaaca gcaagcgctg gaacccagag 2100 
340 attcagtata cttccaacta taacaaatct gttaatgtgg actttactgt ggacactaat 2160 
342 ggtgtgtatt cagagcctcg ccccattg gc accagatacc tg actcgtaa tctgtaa 2217 

345 <210> SEQ ID NO: 5 

346 <211> LENGTH: 2217 

347 <212> TYPE: DNA 

348 <213> ORGANISM^new AAV 

350 <400> SEQUENCE : 5 

351 atggctgccg atggttatct tccagattgg ctcgaggaca acctctctga gggcattcgc 60 
353 gagtggtggg acttgaaacc tggagccccg aaacccaaag ccaaccagca aaagcaggac 120 
355 gacggccggg gtctggtgct tcctggctac aagtacctcg gacccttcaa cggactcgac 180 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: 0S/1O/573 , 600 



DATE: 04/06/2006 
TIME: 10:49:35 



Input Set : A:\UPN-P3230-eequence lieting.txt 
Output Set: Na\CRP4\04062006\J573600.raw 



363 
365 



357 aagggggagc ccgtcaacgc ggcggacgca 
359 cagcagctca aagcgggtga caatccgtac 
361 caggagcgtc tgcaagaaga tacgtctttt 
gccaagaagc gggttctcga acctctcggt 
ggaaagaaga gaccggtaga gccatcaccc 
367 ggcaagacag gccagcagcc cgcgaaaaag 
369 gagtcagtgc ccgaccctca accaatcgga 
371 tctggtacaa tggctgcagg cggtggcgct 
373 ggagtgggta gttcctcagg aaattggcat 
375 atcaccacca gcacccgacc ctgggccctc 
377 atctccaacg ggacatcggg aggaagcacc 
379 ccctgggggt attttgactt taacagattc 
381 cgactcatca acaacaactg gggattccgg 
383 atccaggtca aggaggtcac gcagaatgaa 
385 agcacgattc aggtctttac ggactcggaa 
387 caccagggct gcccgcctcc gttcccggcg 
389 ctgactctga acaacggcag tcaggccgtg 
391 tttccttctc aaatgcggag aacgggcaac 
393 gtgccttttc acagcagcta cgcgcatagc 
395 atcgaccagt acctgtacta cctgtctcgg 
397 cagcagttgc tattttctca ggccgggcct 
399 ctacccgggc cctgctaccg gcagcaacgc 
401 agcaactttg cttggaccgg tgccaccaag 
403 aatcccggtg tcgctatggc aacgcacaag 
405 ggagtcttga tgtttgggaa acagggagct 
407 atgctaacca gtgaggaaga aatcaaaacc 
409 gtggtggccg ataacctgca acagcaaaac 
411 caaggagcct tacctggcat ggtctggcag 
413 tgggccaaga ttcctcacac ggacggcaac 
415 ggactgaaac acccgcctcc tcagatcctg 
417 ccaactacct tcagtcaagc caagctggcg 
419 gtcagcgtgg aaattgaatg ggagctgcag 
421 attcagtata cttccaacta ctacaaatct 
423 ggtacttatt cagagcctcg ccccattggc 

426 <210> SEQ ID NO: 6 

<211> LENGTH: 2217 
<212> TYPE: Dl 

429 <213> ORGANi^fi: new AAV serotype, 
431 <400> SEQUEN 



gcggccctcg 
ctgcggtata 
gggggcaacc 
ctggttgagg 
cagcgttctc 
agactcaact 
gaaccccccg 
ccaatggcag 
tgcgattccg 
cccacctaca 
aacgacaaca 
cactgccact 
cccaagagac 
ggcaccaaga 
taccagctcc 
gacgtcttca 
ggccgttcct 
aactttgagt 
caaagcctgg 
actcagtcca 
aataacatgt 
gtctccacga 
tatcatctga 
gacgacgaag 
ggaaaagaca 
accaacccag 
gccgctccta 
aaccgggacg 
tttcatcctt 
attaagaata 
tcgttcatca 
aaagagaaca 
acaaatgtgg 
acccgttacc 



agcacgacaa 
accacgccga 
tcgggcgagc 
aaggcgctaa 
cagactcctc 
ttgggcagac 
caggcccctc 
acaataacga 
catggctggg 
acaaccacct 
cctacttcgg 
tctcaccacg 
tcaacttcaa 
ccatcgccaa 
cgtacgtcct 
tgattcctca 
ccttctactg 
tcagctacca 
accggctgat 
cgggaggtac 
cggctcaggc 
cactgtcgca 
atggcagaga 
agcgattttt 
acgtggacta 
tggccacaga 
ttgtaggggc 
tgtacctgca 
cgccgctgat 
cacctgttcc 
cgcagtacag 
gcaagcgctg 
actttgctgt 
tcacccgtaa 



ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
tacgggcatc 
tggcgactca 
tggtctggga 
aggcgccgac 
cgacagagtc 
ctacaagcaa 
ctacagcacc 
tgactggcag 
gctcttcaac 
taaccttacc 
cggctctgcg 
gtacgggtac 
cctggagtac 
gtttgaggac 
gaaccccctc 
cgcaggaact 
caaaaactgg 
aaataacaac 
ctctctggta 
tccatccagc 
tagcagcgtt 
acagtacggc 
cgtcaacagt 
gggtcctatc 
gggaggcttt 
cgcggatcct 
caccggacag 
gaacccagag 
caatactgag 
cctgtaa 



427 
428 




clone hu.41 



432 atggctgctg acggttatct tccagattgg 
434 gagtggtggg scctgaaacc tggagccecc. 
436 gacggccggg gtctggtgct tcctggctac 
438 aagggggagc ccgtcaacgc ggcggacgca 
440 cagcagctca aagcgggtga caatccgtac 
442 caggagcgtc tacaagaaga tacgtctttt 
444 gccaagaagc gggttctcga acctctcggt 
446 ggaaagaaga gaccggtaga accgccacct 
448 ggcaagaaag gccagcagcc cgctaaaaag 
450 gagtcagtcc ccgaccctca accaatcgga 



ctcgaggaca acctctctga 
aagcccaagg ccaaccagca 
aagtacctcg gacccttcaa 
gcggccctcg agcacgacaa 
ctgcggtata accacgccga 
gggggcaacc tcgggcgagc 
ccggttgagg aagctgctaa 
cagcgttccc ccgactcctc 
agactgaact ttggtcagac 
gaaccaccag caggcccctc 



gggcattcgc 
gaagcaggac 
cggactcgac 
ggcctacgac 
cgccgagttt 
agtcttccag 
gacggctcct 
cacgggcatc 
tggcgactca 
tggtctggga 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
1980 
2040 
2100 
2160 
2217 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



PJL~ 
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VERIFICATION SUMMARY DATE: 04/06/2006 

* PATENT APPLICATION: US/10/573 , 600 TIME: 10:49:36 

Input Set : A* \UPN-P3230 -sequence listing.txt 
% ' 1 " Output Set: N:\CRP4\04062006\J573600.raw 

L:14 M:270 C: Current Application Number differs. Replaced Current Application No 
L:14 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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